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Prelaw 


This  thesis  presumes  that  the  reader  possesses  a 
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Abstract 

The  purpose  of  this  study  was  to  develop  an  algorithm 
which  could  produce  from  a given  symbol  set  a new  symbol 
set  that  would  be  optimally  separable  by  human  subjects. 

A symbol  change  algorithm  was  developed  based  on  the  Fou- 
rier transform  model  of  the  human  visual  system  and  the 
inverse  relationship  between  the  number  of  human  confu- 
sion errors  that  result  from  the  basic  shapes  of  symbols 
and  the  Euclidian  distances  between  these  symbols  in  the 
Fourier  domain.  The  algorithm  effectively  changes  the 
shape  of  certain  symbols  of  a given  symbol  set  by  con- 
trolled manipulation  of  the  symbol  in  the  transform  do- 
main until  a minimum.  Euclidian  distance  threshold  is 
achieved  between  the  changed  symbol  and  all  other  symbols 
in  the  set.  A symbol  set  conf usability  number  was  also 
developed  which  can  be  used  to  evaluate  different  symbol 
sets  with  the  lower  number  identifying  the  least  confus- 
able  symbol  set.  The  author  believes  that  by  applying 
this  algorithm  for  a given  symbol  set  the  number  of  human 
confusion  errors  that  occur  due  to  the  shapes  of  these 
given  symbols  will  be  effectively  reduced. 
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AN  ALGORITHM  TO  GENERATE  AN  OPTIMALLY 
HUMAN-SEPARABLE  SYMBOL  SET 


I.  Introduction 

The  number  of  two  dimensional  symbolic  codes  used  to 
display  various  meanings  to  weapons  systems  operators  is 
increasing  as  more  complex  weapons  systems  are  developed. 
The  addition  of  each  new  symbolic  code  often  creates 
some  confusion  for  the  systems  operator  who  must  be,  or 
has  been,  familiar  with  more  than  one  weapons  syst> 

This  becomes  especially  critical  in  a combat  environment 
in  which  the  systems  operator  must  be  able  to  rapidly 
differentiate  between  various  symbol  meanings. 

The  need  for  standardization  in  developing  symbolic 
codes  for  a variety  of  weapons  systems  was  stated  quite 
well  by  Honigfeld  in  1964i  "The  need  for  a standard 
symbology  is  highlighted  by  the  fact  that  each  contrac- 
tor who  develops  a radar  system  has,  in  the  past,  been 
allowed  to  arbitrarily  select  a symbol  code  and  its 
meaning  for  display  use.  Since  symbols  have  not  been 
specified  formally,  the  result  is  a unique  code  for  each 
system.  Symbol  meanings  differ  from  system  to  systemi 
identical  meanings  might  be  represented  on  one  display 
by  numbers,  on  another  by  letters,  and  on  a third  by  ge- 
ometric forms"  (Ref  12il). 
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In  addition  to  confusing  the  meanings  of  symbols  in 
one  set  with  those  of  another  set,  the  systems  operator 
may  also  confuse  the  individual  symbols  within  one  set 
with  each  other.  In  1973  a group  from  the  Aerospace 
Medical  Research  Laboratory  (AMRL),  Wright-Patterson  Air 
Force  Base,  Ohio,  studied  the  standardization  problem  of 
symbol  coding  and  range  presentation  in  a high  threat 
environment.  Although  an  "optimum"  symbol  set  was  not 
developed,  the  team  recommended  that  a given  symbol  set 
should  use  a flickering  circle  to  highlight  critical 
threats  and  that  the  alphabetical  form  of  the  letter  "A" 
should  be  replaced  by  a more  distinct  geometrical  form. 
If  this  latter  suggestion  was  implemented,  the  systems 
operator  would  avoid  confusing  the  letter  "A"  with  the 
number  "4"  (Ref  l4i 27-32). 

An  engineering  model  of  the  human  visual  information 
processing  system  proposed  by  Kabrisky  in  1966  (Ref  13) 
and  a subsequent  psychological  study  by  Goble  in  1975 
(Ref  10)  suggest  an  approach  to  explain  the  degree  of 
confusion  between  symbols  such  as  the  letter  "A"  and  the 
number  "4"  in  the  AMRL  symbol  sets.  This  approach  uses 
a two  dimensional,  discrete  Fourier  or  Walsh  transform 
model  for  pattern  recognition  and  provides  the  basis  for 
this  study  which  involves  the  controlled  manipulation  of 
symbols  in  the  transform  domain  to  "create"  new  symbols 
that  are  optimally  separable  by  human  subjects. 
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II.  Background 


Kabrisky  proposed  a human  pattern  recognition 
model  in  1966  based  on  Fourier  analysis.  He  theorized 
that  the  rich  neural  interconnectivity  of  the  visual 
cortex  could  save  a significant  number  of  basic  compu- 
tational elements  by  transferring  data  with  a two  di- 
mensional Fourier  transform  (Ref  13«82).  Using 
Kabrisky* s work,  Radoy,  in  1967,  showed  that  the  en- 
tire English  alphabet  could  be  correctly  classified 
from  the  information  contained  in  the  fundamental  spa- 
tial frequency  component  and  the  first  two  harmonics 
(Ref  16).  Based  on  Kabrisky's  model  of  the  human  vis- 
ual system  and  Radoy *s  Fourier  transform  approach  for 
a pattern  recognition  machine,  further  work  by  Tallman 
in  1969  (Ref  17),  Carl  in  1969  (Ref  3)*  and  Granlund  in 
1970  (Ref  11i195“201)  more  closely  approximated  human 
performance  in  processing  and  classifying  handprinted 
characters  by  machine  (greater  than  95*  accuracy). 

In  1973  an  Aerospace  Medical  Research  Laboratory 
team  studied  problems  associated  with  the  standardiza- 
tion of  symbolic  coding  used  to  display  information  to 
a weapons  systems  operator  in  a high  threat  environ- 
ment. They  found  that  certain  symbols  were  more  like- 
ly to  be  confused  with  each  other  and  recommended  that 
some  of  the  more  confusable  symbols  should  be  either 


replaced  or  changed  (Ref  14 131,32).  In  that  same  year, 
Thomas  applied  Kabrisky's  more  highly  refined,  Fourier 
(or  Walsh)  transform,  pattern  recognition  model  of  the 
human  visual  system  to  develop  a predictor  of  human 
performance  in  certain  symbol  recognition  tasks,  con- 
centrating on  threat  symbols  (Ref  19). 

However,  not  until  Goble's  dissertation  in  1975 
had  the  Euclidian  distances  between  symbols,  charac- 
ters or  patterns  resulting  from  the  transform  model 
been  linked  to  the  human  error  rate.  Goble  indicated 
that  "as  an  inadvertant  concomitant  to  producing  a 
pattern  recognition  device  which  handles  handprinted 
alphabetic  characters,  a matrix  of  Euclidian  distances 
is  produced.  Each  prototype  letter  has  an  Euclidian 
distance  between  it  and  all  other  class  prototypes. 
Using  a third  harmonic,  low-pass  filter  (7x7)  estab- 
lishes a 49  space  domain  in  which  each  filtered  trans- 
form prototype  is  defined  by  one  point  in  this  space. 
Euclidian  distances  are  normalized  so  the  distance  of, 
for  instance,  prototype  A to  prototype  B may  serve  as 
an  indication  of  the  degree  of  error  or  confusability 
in  the  pattern  recognition"  (Ref  10il9-20).  He  pro- 
posed that  an  inverse  relationship  exists  between  the 
Euclidian  distances  in  the  Fourier  or  Walsh  domain  and 
the  number  of  errors.  Thus,  smaller  Euclidian  dis- 
tances are  associated  with  a high  degree  of  error  or 
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confusion  and  larger  Euclidian  distances  are  relative- 
ly error  free. 

The  success  of  this  Fourier  or  Walsh  transform 
model  for  classification  of  visual  imagery  parallels 
the  technological  advances  made  in  digital  computers 
and  processing  techniques  (Ref  1 and  Ref  5).  The  two 
dimensioned  discrete  Fourier  transform  (an  orthogonal 
transform)  can  be  used  to  decompose  input  data  that 
represents  a visual  (spatial)  image.  This  decomposi- 
tion yields  a set  of  sinusoidal  Fourier  components 
that  has  preserved  both  the  distance  and  the  angle  be- 
tween vectors.  Thus,  all  of  the  information  in  the 
original  spatial  image  is  present  in  the  Fourier  (spa- 
tial frequency)  domain,  and  the  spatial  image  can  be 
reproduced  by  adding  the  correct  Fourier  components. 

If  the  higher  spatial  frequency  components  are  re- 
moved , the  basic  form  of  the  original  image  is  still 
preserved  by  the  filtered,  lower  spatial  frequency 
components  (Ref  3*52-52,  Ref  4,  and  Ref  9*92-103). 
Finer  details  of  the  imagery  appear  as  progressively 
higher  frequency  components  are  added  to  the  filtered 
transform  space.  This  property  of  image  processing 
and  classification  not  only  suggests  an  alternate  ap- 
proach to  the  classical  pattern  recognition  theory  of 
distinct  feature  extraction  but  also  facilitates  the 
computer  analyses  of  spatial  images. 


To  fully  employ  the  advantages  of  analysis  by  com- 
puter, the  spatial  image (s)  must  be  initially  repro- 
duced in  a digital  form.  Although  digital  representa- 
tion of  an  image  can  be  accomplished  in  a variety  of 
ways,  the  primary  method  utilizes  a computer  control- 
led, flying  spot  scanner  to  sample  the  intensity  of 
the  image  at  a finite  number  of  evenly  spaced  points 
(Ref  18 *12-1 3).  With  an  appropriate  threshold  inten- 
sity level  to  eliminate  noise,  the  image  can  be  rep- 
resented much  like  the  symbol  in  Figure  1.  A number 
of  computer  techniques  can  then  be  used  to  analyze 
such  digital  simulations. 

One  algorithm  that  performs  a form  recognition 
analysis  on  digitally  represented  input  data  has  been 
developed  by  Gagnon.  His  predictor  of  human  visual 
performance  (PRBVIP)  algorithm  provides  a quantitative 
measure  of  the  overall  suitability  of  a symbol  set 
based  on  the  Euclidian  distances  between  symbols.  The 
full  complement  of  computer  analysis  techniques  and 
options  available  in  PRBVIP  is  not  required  for  the 
initial  purposes  of  this  project.  However,  the  reader 
can  find  a more  complete  description  of  the  PRBVIP 
options  in  reference  number  seven. 

The  following  steps,  included  as  part  of  the  PRBVIP 
algorithm,  are  required  for  this  projecti 
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1.  Transform  each  digitally  represented  symbol 
into  the  spatial  frequency  domain  using  the  Cooley- 
Tukey,  two  dimensional,  discrete  Fourier  transform 
(Ref  5). 

2.  Filter  each  transformed  symbol  retaining  the 
lower  frequency  components  (Ref  1?» 32-37). 

3.  Compute  the  Euclidian  distance  between  each 
possible  pair  of  filtered,  transformed  symbols  (Ref 
10.15). 

The  results  of  this  distance  computation  are  used 
to  determine  the  overall  suitability  or  confusion  of 
the  given  set  of  symbols.  Another  algorithm  which  is 
described  in  Chapter  III  computes  the  confusability  of 
a symbol  set  and  can  be  incorporated  into  PREVIP  as 
another  analysis  option.  This  algorithm  uses  the  Eu- 
clidian distances  to  quantitatively  determine  the  sym- 
bol which  will  cause  the  most  discrimination  errors. 

A new  symbol  can  then  be  "created"  to  replace  this 
most  oonfusable  symbol  by  controlled  manipulation  of 
the  symbol  in  the  transform  domain.  The  algorithm  is 
repeated  until  a new  symbol  sot  that  is  optimally  sep- 
arable by  human  subjects  has  been  ereated. 
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III.  Symbol  Sll  C onf usability  gnfl. 
Symbol  Change  Algorithm 


Symbolic  coding  ie  used  in  a large  number  of  weap- 
ons systems  and  for  an  even  larger  number  of  mission 
profiles.  For  example , point-to-point  navigation, 
terrain  avoidance  aerial  delivery,  close  air  support, 
searoh  and  rescue,  airborne  intercept,  reconnaissance, 
missile  launch  and  recovery,  satellite  tracking,  and 
area  defense  are  but  a few  of  the  broader  mission  pro- 
files that  use  some  set  of  symbols  to  display  informa- 
tion to  the  weapons  systems  operator.  However,  a sin- 
gle weapons  system  may  be  designed  to  perform  more 
than  one  type  of  mission  profile,  and  usually  more 
than  one  weapons  system  can  be  used  to  complete  a sin- 
gle profile.  Furthermore,  if  each  contractor  is  per- 
mitted to  select  a symbol  set  of  his  own  choosing  to 
accompany  the  weapons  system  he  develops,  an  extraor- 
dinarily large  inventory  of  symbol  sets  results.  This 
produces  different  symbols  or  even  entire  symbol  sets 
that  represent  identical  meanings.  For  the  weapons 
systems  operator  who  must  perform  various  tasks  with 
one  or  more  systems,  much  confusion  between  symbols 
can  result  which  leads  to  errors,  especially  under 
conditions  of  high  stress.  Therefore,  an  "optimum" 
symbol  set  (one  whioh  produces  no  discrimination  er- 
rors between  symbols)  is  needed  to  rapidly  display 
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exact  meanings  to  the  operator.  The  Euclidian  distances 
that  result  from  the  predictor  of  human  visual  perform- 
ance (PREVIP)  algorithm  described  in  Chapter  II  can  be 
used  to  develop  this  "optimum"  symbol  set  from  a given 
input,  digitally  represented,  symbol  set. 

Bgprmntfttlvg  symbol  sg$g 

The  first  step  needed  in  the  development  of  a symbol 
change  algorithm  for  this  research  project  is  to  select 
the  symbols  that  provide  a representative  symbol  set  for 
current  and  future  weapons  systems  and  mission  profiles. 

Pour  sets  of  18  symbols  each  adequately  span  the  more 
common  mission  profiles  for  most  weapons  systems.  These 
symbol  sets  are  depicted  in  Figure  2. 

Symbol  set  one  is  composed  of  symbols  that  are  ex- 
tracted from  high  altitude  and  low  altitude  enroute  and 
terminal  charts  prepared  by  the  Defense  Mapping  Agency 
Aerospace  Center  (Ref  6).  The  symbols  of  set  one  are 
used  primarily  for  navigational  purposes.  Symbol  set 
two  consists  of  a typical  set  of  cathode  ray  tube  sym- 
bols used  to  represent  cartographic  points  in  a high 
threat  or  combat  environment  but  is  easily  adapted  to 
a large  variety  of  mission  profiles.  Symbol  set  three 
is  developed  from  symbols  and  modifiers  used  by  Thomas 
in  his  recognition  tests  (Ref  19’ 20).  Symbol  set  four 
is  composed  of  symbols  taken  from  a National  Aeronautics 
and  Space  Administration  symbology  character  set  used 
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Figure  2.  Symbol  Sets  Used  in  Symbol  Change' Algorithm 


11 


for  Electronic  Attitude  Direction  Indicator  (EADI)  dis- 
plays (Ref  20(129). 

Digital  glmlltlttn 

After  the  symbol  sets  are  chosen,  they  are  digit- 
ised using  a coaputer  controlled  flying  spot  scanner 
(Digital  Equipaent  Corporation  PDP-12  general  purpose 
coaputer).  For  this  project,  all  of  the  syabols  are 
sise  normalised  so  that  the  largest  diaension  of  each 
syabol  is  one  inch.  The  only  exception  to  this  normal- 
isation is  in  syabol  set  two  where  a half  inch  square 
is  used.  The  syabols  are  centered  in  a two  inch  square 
and  scanned  on  a grid  of  varying  sise  with  the  flying 
spot  scanner.  A 64x64  point  grid  is  used  for  the  data 
in  this  study.  The  subsequent  intensity  pattern  can  be 
written  on  a Magnetic  tape  and  a threshold  applied  us- 
ing the  Control  Data  Corporation  (CDC)  6600  coaputer  to 
provide  a two  level  intensity  pattern  (black  on  a white 
background)  froa  which  the  noise  (undesired  intensity 
level)  is  renoved.  Bach  digitally  represented  syabol 
can  be  punched  on  coaputer  cards  in  octal  foraat  for 
use  as  input  data  with  PKBVIP. 

A higher  resolution  is  obtained  by  using  the  64x64 
point  grid  rather  than  a 32x32  point  or  saaller  also 
grid  since  the  syabols  are  about  half  the  site  of  the 
window  (the  two  inoh  square).  Also,  edge  effects  due 
to  the  induoed  periodicity  of  the  Fourier  transfora  are 
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minimised  by  employing  the  larger  grid.  However,  a 
larger  computer  storage  capacity  is  required  than  is 
available  with  the  PDP-12  computer.  Therefore,  the  CDC 
6600  computer  is  needed  and,  rather  than  use  a large 
block  of  permanent  file  storage,  the  data  is  stored  on 
computer  cards. 

PlBtMlgJ  Ma&Clfilft 

Operating  with  the  digitized  symbols  as  input  data, 
task  one  of  PREVIP  (Ref  7)  is  used  to  calculate  the  Eu- 
clidian distances  between  each  possible  pair  of  input 
symbols  and  arrange  these  distances  in  matrix  form. 

Using  the  special  PREVIP  nomenclature,  a "field  of  view" 
of  four  degrees  by  four  degrees  represents  the  64x64 
point  grid  with  a "size  of  interest"  parameter  of  two 
cycles  per  degree.  Since  the  fundamental  spatial  fre- 
quency is  the  reciprocal  of  the  field  of  view  in  the 
direction  of  interest,  the  data  is  analyzed  with  a fun- 
damental frequency  of  0.25  cycles  per  degree  in  both 
dimensions.  The  size  of  interest  parameter  determines 
the  weight  assigned  to  each  spatial  frequency  component 
according  to  the  following  three  equations  1 

FOA  *30«  ln(6*SI) 

where  POA  is  the  "field  of  attention"  and  SI  is  the 
size  of  interest. 
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VFXT7~FY 


8 


x C 


-FOA/JO 


(2) 


where  FN  is  the  normalized  spatial  frequency,  FX  is  an 
integer  that  corresponds  to  harmonics  of  the  fundamen- 
tal spatial  frequency  in  the  x direction,  and  FY  is  an 
integer  that  corresponds  to  harmonics  of  the  fundamen- 
tal spatial  frequency  in  the  y direction. 


WEIGHT  = FN  x e‘-FN  <■> 

where  WEIGHT  is  the  resulting  weighting  factor  assigned 
to  each  spatial  frequency  in  the  y direction. 

Bqs  (1),  (2),  and  (3)  apply  a human  modulation 
transfer  function  to  filter  the  data  in  the  Fourier  do- 
main. This  is  approximately  equivalent  to  a 7x7  low- 
pass  filter  except  for  weighting  factors  of  unity  for 
the  second  harmonic  and  nearly  one  for  the  fundamental 
and  third  harmonic.  The  weighting  factor  then  de- 
creases rapidly  as  the  frequency  increases.  The  dis- 
tance matrix  is  computed  from  these  filtered  Fourier 
components.  This  distance  matrix  is  then  rearranged  so 
that  eaoh  row  of  distanoes  are  in  order  of  increasing 
magnitude  from  the  reference  symbol.  For  example,  a 
five-symbol  set  is  shown  in  Figure  3 with  the  corre- 
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Table  I 

Distance  Matrix 


DIR 

DIR 

.0000 

NAV 

.7978 

ITX 

.2126 

VTC 

.8330 

MRP 

.8550 

NAV 

.7978 

.0000 

.7894 

.3520 

.2980 

ITX 

.2126 

.7894 

.0000 

.8191 

.8371 

VTC 

.8330 

.3520 

.8191 

.0000 

.3116 

MRP 

.8550 

.2980 

.8371 

.3116 

.0000 

Table  II 

Ordered  Distance  Matrix 


DIR 

DIR 

.0000 

ITX 

.2126 

NAV 

.7978 

VTC 

.8330 

MRP 

.8550 

NAV 

NAV 

.0000 

MRP 

.2980 

VTC 

.3520 

ITX 

.7894 

DIR 

.7978 

ITX 

ITX 

.0000 

DIR 

.2126 

NAV 

.7894 

VTC 

.8191 

MRP 

.8371 

VTC 

VTC 

.0000 

MRP 

.3116 

NAV 

.3520 

ITX 

.8191 

DIR 

.8330 

MRP 

MRP 

.0000 

NAV 

.2980 

VTC 

.3116 

ITX 

.8371 

DIR 

.8550 
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•ponding  distance  matrix  and  ordered  distance  matrix 
depicted  in  Tables  I and  II  respectively. 

It  is  the  ordered  distance  matrix  from  PREVIP  that 
is  used  to  sequentially  compute  the  symbol  set  confuse- 
bility  number*  determine  the  most  confusable  symbol  in 
the  set*  and  enter  the  symbol  change  algorithm  which  al- 
ters the  basic  form  of  the  most  confusable  symbol  to 
achieve  a less  confusable  symbol  and  a symbol  set  which 
causes  fewer  discriminability  errors. 


SXS&fil  Sll  Conf usability  Nuafegf 

The  symbol  set  confusability  number  (SSC)  is  defined 
to  be 


SSC 


i 

NS (NS- 1) 


NS  NS 

in 

i*l  j«l 


(4) 


where  i is  an  integer  that  corresponds  to  a particular 
reference  symbol  and  row  of  the  ordered  distance  matrix* 
j is  an  integer  that  corresponds  to  a column  of  the  or- 
dered distance  matrix*  NS  is  the  number  of  symbols  in 
the  symbol  set*  and  DS  is  the  Euclidian  distance  between 
the  ith  and  jth  symbols.  If  DSjj  is  less  than  0.001*  it 
is  set  equal  to  0.001  to  keep  SSC  finite. 

Individual  Symbol  Conf usability  Number.  Individual 
symbol  confusability  numbers  (ISCs)  are  calculated  from 
the  ordered  distance  matrix  according  to 


(5) 
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where  i,  j,  and  DS  are  as  defined  for  Eq  (4).  Since  the 
Euclidian  distances  are  computed  from  mappings  on  an  NS 
dimensional  sphere  of  unit  radius,  the  maximum  distance 
between  any  two  symbols  is  two.  Conversely,  the  minimum 
distance  is  zero.  Therefore,  the  range  of  ISC  is  from 
zero  for  ideally  separated  symbols  with  a distance  of 
two  to  999.5  for  identical  symbols  with  a distance  of 
zero. 


Reference  SymfrQl  Conf usability  Number.  To  represent 
the  confusability  of  one  symbol  with  respect  to  the 
other  symbols  in  the  set,  the  individual  symbol  confusa- 
bility numbers  are  used  in  the  following  equation  to 
calculate  a reference  symbol  confusability  number  (RSC)« 


RSq,  - 1 isc; 


(6) 


where  i,  j,  and  NS  are  as  defined  for  Eq  (4).  The  ref- 
erence symbol  confusability  numbers  are  stored  in  a sym- 
bol set  confusability  matrix  (SC  matrix).  An  example  of 
an  SC  matrix  is  shown  in  Table  III  which  is  derived  from 
the  ordered  distance  matrix  in  Table  II. 


Table  III 

Symbol  Set  Confusability  Matrix 


4 

3 

2 

1 

DIR 

6.3272 

2.1235 

1.3701 

.6696 

NAV 

6,7168 

3.8611 

1.5202 

.7534 

ITX 

6.3859 

2.1822 

1.4155 

.6946 

VTC 

6.4715 

3.7622 

1.4213 

.7005 

MRP 

6.9291 

4.0734 

1.3642 

.6696 

The  RSC  in  the  highest  numbered  column  of  the  SC 
matrix  represents  the  confusability  of  the  reference 
symbol  with  respect  to  all  other  symbols  in  the  set. 

The  RSC  in  the  next  highest  numbered  column  represents 
the  confusability  with  respect  to  all  other  remaining 
symbols  in  the  set  after  the  closest  symbol  to  the  ref- 
erence symbol  (nearest  neighbor)  has  been  deleted.  The 
RSC  in  the  next  column  reflects  the  deletion  of  the  two 
closest  symbols  to  the  reference  symbol  and  so  on.  The 
number  of  the  column  in  the  SC  matrix  corresponds  to 
the  number  of  symbols  with  which  the  reference  symbol 
is  compared.  The  confusability  of  the  reference  symbol 
with  respect  to  the  other  symbols  of  the  set  increases 
as  the  RSC  in  the  first  column  increases.  However, 
care  must  be  taken  when  comparing  reference  symbol  con- 
fusability numbers  in  other  columns  since  the  deleted 
symbols  will  not  follow  the  same  order  for  all  rows. 

The  comparison  is  only  valid  when  the  same  symbols  have 
been  deleted  for  each  reference  symbol. 

The  symbol  set  confusability  number  is  the  normal- 
ized svun  of  the  reference  symbol  confusability  numbers 
in  the  first  column  of  the  SC  matrix  and  can  be  used  to 
evaluate  the  overall  suitability  or  separability  of  a 
given  symbol  set.  The  SSC  for  the  five-symbol  set  cor- 
responding to  the  SC  matrix  in  Table  III  is  1.641525. 
The  range  of  SSC  is  from  zero  for  an  ideal  symbol  set 
where  all  distances  are  two  to  999.5  for  a symbol  set 
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of  identical  symbols  where  all  distances  are  zero.  Then 
for  two  or  more  given  symbol  sets,  the  set  with  the  lower 
SSC  is  the  most  discernible,  least  conf usable  symbol  set. 

M.ff.s,t  Conf  usable  Symbol 

The  ordered  distance  matrix  and  the  symbol  set  con- 
fusability  matrix  also  provide  the  basis  for  two  differ- 
ent methods  of  defining  the  most  conf usable  symbol.  The 
method  using  the  ordered  distance  matrix  selects  the  most 
confusable  symbol  from  one  of  the  pair  (or  perhaps  more) 
of  symbols  with  the  smallest  distance  in  column  two. 

From  this  symbol  pair  (or  group),  the  most  confusable 
symbol  is  that  symbol  with  the  smallest  distance  in  col- 
umn three  of  the  ordered  distance  matrix.  If  more  than 
one  of  these  symbols  has  the  same  distance  in  column 
three,  the  procedure  is  repeated  for  the  pair  (or  group) 
of  symbols  by  comparing  the  distances  in  the  next  column 
to  the  right  until  a most  confusable  symbol  is  deter- 
mined. If  the  last  column  of  the  ordered  distance  matrix 
is  checked  and  more  than  one  symbol  remains,  then  these 
symbols  are  identical  symbols,  and  the  most  confusable 
symbol  is  any  one  of  these  remaining  symbols.  The  most 
confusable  symbol  then  represents  that  symbol  of  the 
given  symbol  set  which  results  in  the  most  errors  in  a 
pair-wise  discriminability  test. 

Another  method  to  define  the  most  confusable  symbol 
chooses  the  symbol  with  the  highest  RSC  in  the  first 
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column  of  the  SC  matrix.  If  the  reference  symbol  con- 
fusability  numbers  in  the  first  column  for  two  or  more 
symbols  are  equal  and  also  the  highest,  then  the  rows 
of  the  SC  matrix  corresponding  to  those  symbols  are 
checked  for  the  highest  RSC  in  the  second  column.  If 
two  or  more  symbols  still  remain,  the  procedure  is  re- 
peated for  each  succeeding  column  until  a most  conf us- 
able symbol  is  determined.  If  two  or  more  rows  of  the 
last  column  of  the  SC  matrix  are  checked  and  the  refer- 
ence symbol  conf usability  numbers  are  equal,  then  these 
symbols  are  identical  symbols  and  the  solution  is  triv- 
ial. The  most  conf usable  symbol  by  this  method  repre- 
sents that  symbol  of  the  given  symbol  set  which  is  more 
easily  confused  when  compared  with  all  other  symbols  in 
the  set. 

Por  most  symbol  sets,  the  two  methods  described 
above  will  yield  the  same  most  confusable  symbol  for 
the  set.  However,  in  certain  instances  where  two  sym- 
bols in  a given  set  are  identical  or  nearly  identical 
(small  Euclidian  distance  between  the  symbols)  and  are 
quite  discernible  from  all  other  symbols  in  the  set 
(relatively  large  Euclidian  distances  to  the  other  sym- 
bols), it  is  possible  for  each  method  to  produce  a dif- 
ferent most  confusable  symbol.  The  five-symbol  set 
shown  in  Figure  3 is  such  a symbol  set.  The  first 
method  determines  that  the  ITX  symbol  is  the  most  con- 
fusable symbol,  while  the  second  method  shows  the  MRP 


20 


symbol  as  the  most  conf usable  symbol.  The  method  to 
use  depends  upon  what  the  most  confusable  symbol  Is  to 
represent.  In  most  cases,  the  Intent  of  changing  the 
most  confusable  symbol  is  to  reduce  the  number  of  er- 
rors associated  with  the  discriminability  between  sym- 
bols. Therefore,  normally  the  most  confusable  symbol 
is  determined  by  the  first  method  using  the  ordered 
distance  matrix. 

Symbol  Change  Theory 

Once  the  most  confusable  symbol  is  determined,  that 
symbol  is  changed  in  order  to  achieve  a less  confusable 
symbol  and  reduce  the  discriminability  errors  associ- 
ated with  the  given  symbol  Bet.  Three  different  ap- 
proaches to  change  this  symbol  are  investigated  for 
this  project.  The  first  approach  involves  subtracting 
the  Pourier  (spatial  frequency)  components  of  the  near- 
est neighbor  to  the  most  confusable  symbol  from  the 
weighted  Pourier  components  of  the  most  confusable  sym- 
bol. The  nearest  neighbor  is  that  symbol  in  the  second 
column  of  the  reference  row  in  the  ordered  distance  ma- 
trix using  the  most  confusable  symbol  as  the  reference 
symbol.  The  second  and  third  procedures  compare  corre- 
sponding spatial  frequency  components  of  the  most  con- 
fusable symbol  and  its  nearest  neighbor.  The  second 
method  reflects  the  theory  that  possibly  those  corre- 
sponding spatial  frequency  components  that  are  differ- 
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ent  in  value  provide  the  discemibility  between  the 
symbols.  Therefore,  if  those  components  that  are  dif- 
ferent are  made  more  different,  then  the  distance  be- 
tween the  two  symbols  will  increase  and  the  symbols 
will  be  more  discernible.  The  third  method  is  similar 
to  the  second  except  that  the  corresponding  spatial 
frequency  components  that  are  equal  or  nearly  equal  are 
made  different  in  value  while  those  components  that  are 
already  different  are  changed  none  or  very  little. 

This  procedure  should  also  increase  the  distance  be- 
tween the  symbols  and  make  them  more  discernible. 

Wgjfihtgfl  gaarangni  Approach.  The  weighted  Fourier 
component  approach  applies  the  following  equation  to 
change  the  most  conf usable  symbol i 

FC,.W  = ( FCU  . WT  - FC^  ) /WT  m 

where  FCn#w  is  the  new  Fourier  component  of  the  most 
confusable  symbol,  FC^^  is  the  old  Fourier  component 
of  the  most  confusable  symbol,  PCnl)r  is  the  correspond- 
ing Fourier  component  of  the  nearest  neighbor  to  the 
most  confusable  symbol,  and  WT  is  the  weighting  factor 
which  determines  how  much  the  Fourier  component  and, 
therefore,  the  symbol  will  change. 

As  WT  increases,  there  is  less  change  in  the  Fou- 
rier component.  For  this  project,  values  for  WT  have 
been  varied  between  one  and  20  with  a weighting  factor 
of  five  providing  the  most  controllable  change  in  the 
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symbol.  A weighting  factor  of  more  than  five  makes 
very  little  change  in  the  symbol.  A weighting  factor 
of  less  than  five,  however*  makes  too  large  of  a change 
in  the  Fourier  component  to  interpret  the  spatial  image 
of  the  changed  symbol  which  results  from  the  inverse 
Fourier  transform  using  the  filtered*  changed  Fourier 
components.  If  five  is  used  as  a weighting  factor  in 
Eq  (7)*  a definite*  controllable  change  in  the  symbol 
occurs  which  enables  reproduction  of  the  changed  symbol. 

Separate  Different  CoasaDgUlB . The  method  which 
takes  those  corresponding  spatial  frequency  components 
of  the  most  confusable  symbol  and  its  nearest  neighbor 
and  makes  more  different  those  components  that  are  al- 
ready different  uses  the  following  symbol  change  equa- 
tion* 

FC_  - FCm  ( 1 ♦ (FD  * WT))  <8) 

where  FCnew  and  FCQld  are  as  defined  in  Eq  (7),  FD  is 
the  absolute  value  of  the  difference  between  the  values 
of  FCQld  and  PCnbr  (defined  in  Eq  (7))#  and  WT  is  a 
weighting  factor  that  determines  the  amount  of  change 
in  the  Fourier  component. 

As  the  weighting  faotor  increases,  the  change  in 
the  spatial  frequency  component  also  increases. 

Weighting  factors  between  one  and  five  have  been  inves- 
tigated. A weighting  factor  of  unity  shows  a negligi- 
ble change  in  the  symbol  and  the  distance  to  its  near- 
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est  neighbor,  while  a greater  weighting  factor  in- 
creases the  distance  but  with  no  significant  change  in 
the  symbol.  This  is  caused  by  a change  in  those  Fou- 
rier components  that  contribute  energy  to  the  back- 
ground at  points  sufficiently  distant  from  the  symbol 
itself  to  make  no  effective  change  in  the  symbol  al- 
though this  additional  energy  does  affect  the  Euclidian 
distance  computation.  The  result  is  a very  smeared, 
filtered  image  of  the  changed  symbol. 

Separate  Like  Components . The  third  approach  which 
takes  the  corresponding  Fourier  components  of  the  most 
confusable  symbol  and  makes  like  components  different 
applies  the  following  symbol  change  equationi 

FC^  = FCoU(l  + ( FD^W-rj)  <») 

where  FCnew  and  FCold  are  as  defined  in  Eq  (7),  FD  is 
as  defined  in  Eq  (8)  but  ia  set  equal  to  0.01  in  order 
to  keep  FCnew  finite,  SIGN  is  positive  if  FCQld  is 
larger  than  FCnbr  and  negative  if  the  reverse  is  true, 
and  WT  is  a weighting  factor  which  determines  how  much 
the  component  will  change. 

The  symbol  ehange  becomes  much  more  pronounced  in 
this  case  as  the  weighting  factor  decreases.  Weighting 
factors  between  100  and  1000  have  been  investigated 
during  this  study  with  factors  over  150  showing  no  sig- 
nificant change  apparent  in  the  shape  of  the  symbol, 
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yet  a large  increase  in  distance  to  the  nearest  neigh- 
bor. This  is  again  caused  by  the  contribution  of  energy 
from  the  changed  Fourier  components  to  the  background  of 
the  symbol.  A weighting  factor  of  100  effectively 
changes  the  symbol  and  increases  the  distance  to  the 
nearest  neighbor,  however,  the  corresponding  change  to 
the  background  of  the  symbol  makes  the  recognition  of 
the  changed  symbol  extremely  difficult. 

symbol  Vffgtor  flgvmrrt 

The  next  problem  is  to  determine  how  much  to  change 
a given  symbol  using  any  one  of  Eqs  (7),  (8),  or  (9)  and 
how  to  control  the  direction  of  movement  of  the  symbol 
vector  in  Fourier  space  or  limit  the  distance  from  other 
symbols  in  the  set  while  increasing  the  distance  from 
the  nearest  neighbor.  An  arbitrary  value  of  0.5  is  used 
as  a threshold  Euclidian  distance  in  the  algorithm. 

This  figure  is  based  on  reviewing  the  distances  in  a 
number  of  different  symbol  sets  (mostly  those  sets  con- 
sisting of  letters  and  numbers)  and  seems  to  be  a large 
enough  distance  so  that  discriminability  errors  should 
be  quite  small,  if  any  do  exist.  The  appendix  includes 
a discussion  and  some  results  of  additional  research 
conducted  into  the  area  of  determining  an  error  free  Eu- 
clidian distance.  This  algorithm  changes  the  most  con- 
fusable  symbol  until  the  distance  to  its  nearest  neigh- 
bor is  equal  to  or  greater  than  the  threshold  distance. 
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The  direction  in  which  the  changed  symbol  moves  in 
relation  to  the  other  symbols  in  the  set  is  also  impor- 
tant. To  provide  controlled  movement  of  the  symbol* 
the  distances  corresponding  to  the  latest  change  in  the 
symbol  are  compared  to  those  distances  resulting  from 
the  previous  change.  If  any  of  the  previous  distances 
are  below  the  threshold  and  decrease  when  the  symbol  is 
changed*  then  a symbol  other  than  the  nearest  neighbor 
is  used  in  place  of  the  nearest  neighbor.  This  "new 
neighbor"  symbol  is  the  closest  symbol  to  the  most  con- 
fusable  symbol  disregarding  the  nearest  neighbor  sym- 
bol. If  again  any  distance  decreases  that  was  previ- 
ously below  the  threshold,  this  procedure  is  repeated 
until  all  other  symbols  are  exhausted. 

If  a direction  of  movement  is  found  in  which  all  of 
the  distances  below  the  threshold  increase  with  the 
change*  the  symbol  being  used  as  the  nearest  neighbor 
in  the  symbol  change  equations  is  retained  as  the  near- 
est neighbor  symbol.  The  symbol  changes  are  repeated 
until  the  minimum  distance  from  the  most  conf usable 
symbol  to  any  other  symbol  in  the  set  is  at  least  as 
great  as  the  threshold.  This  requires  at  least  one  it- 
eration and  is  dependent  upon  the  original  distance  be- 
tween the  reference  symbol  and  its  nearest  neighbor* 
the  direction  of  movement  of  the  symbol  vector,  and  the 
weighting  factor  used  in  the  symbol  change  equation. 
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Symbol  Recognition 


After  changing  the  Fourier  components  of  the  most 
confusable  symbol  to  improve  the  number  of  discrimina- 
bility  errors  associated  with  it,  the  task  of  reproduc- 
ing the  "new"  symbol  is  required  in  order  to  verify  the 
results.  Within  the  symbol  change  algorithm,  the  new 
Fourier  components  are  used  in  taking  the  inverse,  dis- 
crete, two  dimensional  Fourier  transform  of  the  filter- 
ed symbol.  Employing  a printing  subroutine  option  of 
PREVIP  (Ref  7),  the  symbol  is  then  reproduced  in  the 
spatial  domain  using  the  resulting  spatial  components 
of  the  inverse  transform. 

This  completes  the  algorithm  for  changing  one  sym- 
bol of  a given  symbol  set.  If  more  than  one  symbol 
change  is  desired,  the  changed  symbol  must  be  redigi- 
tited  and  reentered  into  the  algorithm  in  place  of  the 
old  symbol.  This  procedure  can  be  repeated  until  all 
symbols  in  the  input  symbol  set  have  distances  from 
each  other  at  least  as  great  as  the  threshold  distance. 
The  number  of  times  that  this  procedure  must  be  repeat- 
ed is  dependent  upon  the  number  of  different  pairs  of 
symbols  in  the  original  symbol  set  that  had  distances 
below  the  threshold  and  the  difference  of  each  of  these 
distances  from  the  threshold. 
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IV.  Results  ftD&  Discussion 

The  symbol  set  confusability  and  symbol  change  algo- 
rithm discussed  in  Chapter  III  was  used  to  compare  each 
of  the  four  symbol  sets  illustrated  in  Figure  2 both  be- 
fore and  after  one  symbol  deletion  and  one  symbol 
change.  Furthermore,  symbol  set  one  was  changed  by  this 
algorithm  so  that  the  Euclidian  distances  between  all 
possible  pairs  of  symbols  were  increased  to  at  least  the 
0.5  threshold  distance.  This  threshold  distance  is  an 
arbitrary  value.  The  0.5  value  was  chosen  since  larger 
distances  are  associated  with  fewer  discriminability 
errors  (Ref  10i20). 

Symbol  Confusability  Number  Results 

The  symbol  set  confusability  number  (SSC)  for  each 
of  the  four  symbol  sets  was  computed  to  determine  which 
of  the  original  symbol  sets  was  associated  with  the  few- 
est number  of  discriminability  errors.  The  SSC  repre- 
sents the  overall  suitability  or  separability  of  a sym- 
bol set  with  the  low  numbers  being  associated  with  fewer 
discriminability  errors.  The  ideal  SSC  would  be  zero 
for  an  error  free  symbol  set.  From  Table  IV,  the  re- 
sults indicated  that  symbol  set  three  was  the  best  sym- 
bol set  initially  since  the  SSC  of  0.822801  was  lower 
than  that  for  the  other  symbol  sets.  After  deleting  the 
most  oonf usable  symbol  in  each  set,  the  effect  on  the 
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Table  IV 

Symbol  Set  Confusability  Numbers  (SSC) 


Symbol 

Set 

Initial 

SSC 

SSC  after 
Symbol  Deletion 

SSC  after 
Symbol  Change 

1 

.91020? 

.883114 

.891442 

2 

1.102955 

1.072673 

1.056545 

3 

.822801 

.816180 

.842880 

4 

1.172562 

1.144084 

1.118844 

SSC  Indicated  an  improvement  in  the  overall  separability 
of  each  set. 

Upon  changing  the  most  conf usable  symbol  in  each 
set»  a comparison  of  the  resulting  symbol  set  confusa- 
bility numbers  indicated  an  improvement  in  sets  one,  two 
and  four.  However*  the  new  SSC  for  symbol  set  three  in- 
dicated a slightly  worse  symbol  set  than  the  original 
set  three  although  the  minimum  distance  from  the  most 
conf usable  symbol  increased  from  0.3025  to  0.40?4  after 
the  change.  The  reason  for  the  increase  in  the  SSC  was 
that  the  changed  symbol  was  closer  to  15  of  the  other 
symbols  in  the  set  after  the  change.  This  did  not  af- 
fect the  direction  of  movement  of  the  change  since  the 
original  distances  to  these  15  symbols  were  all  above 
the  threshold  and  since  the  original  distances  to  the 
remaining  two  symbols  increased  with  the  change.  How- 
ever, after  both  deleting  and  changing  the  most  conf us- 
able symbol,  symbol  set  three  still  had  the  lowest  SSC 
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which  indicated  that  it  was  the  set  with  the  fewest 
number  of  discriminability  errors. 

MggS  Conf usable  fihftaggfl 

The  ordered  distance  matrix  method  was  used  to  de- 
termine the  most  confusable  symbol  for  the  results  il- 
lustrated in  this  chapter  since  the  change  in  the  most 
confusable  symbol  was  intended  to  reduce  the  number  of 
errors  associated  with  pair-wise  discriminability. 

The  weighted  Fourier  component  approach  which  em- 
ploys Eq  (7)  was  used  to  change  the  most  confusable 
symbol  in  all  four  sets.  This  approach  provided  the 
most  significant,  controllable  change  in  the  symbol 
such  that  the  symbol  could  be  reproduced  and  reentered 
into  the  data  set. 

Effect  of  One  Symbol  Change . Table  V shows  the  re- 
sults of  one  symbol  change  on  each  of  the  four  given 
symbol  sets.  The  most  confusable  symbol  of  each  set  is 
illustrated  in  the  table,  its  nearest  neighbor  symbol 
is  identified,  and  the  distance  to  that  nearest  neigh- 
bor is  also  shown.  The  changed  symbol  in  reproduced 
form  is  depicted  to  the  right  of  the  original  symbol 
with  the  corresponding  nearest  neighbor  and  distance. 

Although  the  most  confusable  symbols  were  signifi- 
cantly changed  and  the  distances  to  the  nearest  neigh- 
bor increased  for  each  of  the  symbol  sets,  the  new  dis- 
tances were  still  below  the  0.5  threshold  after  repro- 
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Table  V 

of  One  Symbol  Change 
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duct ion  of  the  symbol.  This  indicated  that  the  changed 
symbol  was  not  accurately  reproduced  since  the  distance 
to  any  other  symbol  in  the  set  had  been  increased  to  0.5 
or  more  by  the  algorithm.  This  inaccuracy  was  caused  by 
the  lack  of  the  higher  frequency  components  which  con- 
tained the  detailed  information  relating  to  the  shape  of 
the  changed  symbol.  Only  the  basic  form  of  the  changed 
symbol  was  available  since  only  the  filtered  spatial 
frequency  components  were  used  in  the  symbol  change  al- 
gorithm. In  reconstructing  the  symbol,  the  details  in 
the  shape  such  as  sharp  edges  versus  curved  edges  or 
pointed  protrusions  versus  rounded  ones  were  not  accu- 
rately reproduced. 

To  explain  this  point  further,  an  application  of  the 
symbol  change  algorithm  on  the  most  conf usable  symbol  of 
symbol  set  one  is  demonstrated  through  a series  of  il- 
lustrations. The  original,  digitally  represented  symbol 
is  depicted  in  Figure  4 (black  on  a white  background). 
Pigure  5 shows  the  original  symbol  after  it  had  been 
filtered  by  the  modulation  transfer  function  of  PREVIP. 
For  easier  viewing,  the  filtered  symbols  are  shown  as 
white  on  a black  background.  After  five  symbol  changes 
using  Eq  (7),  the  "new"  symbol  in  its  filtered  form 
appeared  as  in  Pigure  6 with  a corresponding  distance  to 
its  nearest  neighbor  (RAD)  of  0.5110.  The  illustration 
in  Figure  7 shows  the  reproduced,  digitally  represented 
symbol  after  an  arbitrary  threshold  was  applied  to  the 
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Figure  5.  Filtered  NAV  Syabol 
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Figure  6.  NAV  Symbol  after  Symbol  Change  Algorithm 
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Figure  8.  "New"  Filtered  NAY  Symbol 
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Table  VI 

Comparison  of  Distances  from  NAV  Symbol 


Changed  Most 
Conf usable  Symbol 

Digitally 
Reproduced  Symbol 

Symbol 

Distance 

Symbol 

Distance 

RAD 

.5110 

RAD 

.3418 

WXS 

.5360 

WXS 

.4523 

APT 

.6160 

WXR 

.4854 

WXR 

.7152 

APT 

.6128 

VOR 

.7418 

TAC 

.6679 

TAC 

.7540 

VOR 

.7017 

ORP 

.7660 

ORP 

.7324 

AAM 

.8397 

AAM 

.7529 

HDG 

.8506 

HDG 

.7935 

EAC 

.8901 

VTC 

.8027 

VTC 

.8956 

EAC 

.8221 

TGT 

.8984 

TGT 

.8454 

SAM 

.9096 

ITX 

.8554 

ITX 

.9142 

SAM 

.8705 

AAA 

.9178 

AAA 

.9142 

FAC 

.9844 

MRP 

.9634 

MRP 

1.0516 

FAC 

1.0121 

basic  form  of  the  symbol  shown  in  Figure  6.  To  deter- 
mine how  accurately  the  basic  form  of  this  "new”  symbol 
was  interpreted,  the  filtered  version  of  the  reproduced 
symbol,  depioted  in  Figure  8,  should  be  compared  to  the 
basic  form  in  Figure  6.  Any  errors  in  interpretation 
are  easily  seen.  Also,  Table  VI  indicates  how  accu- 
rately the  changed  symbol  was  reproduced  if  one  com- 
pares the  distances  and  the  order  of  the  symbols  be- 


tween  the  column  corresponding  to  the  changed  symbol 
and  the  column  corresponding  to  the  reproduced  symbol. 
Ideally,  the  order  of  symbols  and  the  distances  to 
those  symbols  from  the  reproduced  symbol  would  be  iden- 
tical to  those  of  the  changed  symbol.  However,  the  re- 
sulting differences  between  the  symbols  are  caused  by 
the  loss  of  the  higher  frequency  detail  of  the  changed 
symbol  due  to  filtering. 

Complete  Symbol  Set  Change.  The  results  of  repeat- 
ed applications  of  the  symbol  change  algorithm  on  sym- 
bol set  one  are  shown  in  Figure  9 and  in  Table  VIII. 
Only  eight  symbols  of  the  original  18  have  been  altered 
in  order  to  achieve  the  minimum  Euclidian  distance  of 
0.5  between  all  possible  pairs  of  symbols.  Figure  9 
shows  these  eight  symbols  with  each  new  symbol  depicted 
to  the  right  of  the  original  symbol.  The  original  dis- 
tance matrix  of  symbol  set  one  is  shown  in  Table  VII. 
The  data  in  this  table  should  be  compared  with  that  of 
Table  VIII,  which  contains  the  distance  matrix  of  set 
one  after  the  eight  symbols  have  been  changed.  Only 
those  distanees  in  the  columns  and  rows  corresponding 
to  the  eight  changed  symbols  are  affected.  Those  dis- 
tances that  were  below  the  threshold  in  Table  VII  have 
been  increased  to  at  least  0.5  in  Table  VIII. 

These  results  indioate  that  the  symbol  ehange  algo- 

Crithm  made  significant  ohanges  in  the  input  symbol  set 
with  a corresponding  Increase  in  the  distances  between 
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Symbol  Original  New  Symbol  Original  New 

Code  Symbol  Symbol  Code  Symbol  Symbol 

m @ @ 

- V V 

tot  i — i 
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NAV  ( 
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- (=>  C=> 

” 0 ^ 

hdg  O £ 3 

Figure  9.  Changed  Symbols  of  Symbol  Set  One 


the  symbols.  The  algorithm  also  lowered  the  symbol  set 
oonfusability  number  from  0,910207  to  0.805384.  There- 
fore, the  number  of  discriminability  errors  associated 
with  the  symbol  set  should  be  fewer  after  the  changes. 
Thus,  the  application  of  the  symbol  ohange  algorithm 
shows  a definite  improvement  in  the  original  symbol  set. 
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V.  Conclusions  and,  Recommendations 


The  results  In  Chapter  IV  are  based  on  the  symbol 
change  algorithm  discussed  in  Chapter  III.  This  chap- 
ter reviews  this  symbol  change  algorithm  and  recommends 
some  areas  for  further  research.  Table  IX  summarizes 
the  symbol  change  algorithm  as  a part  of  the  predictor 
of  human  visual  performance  (PREVIP)  algorithm  (Ref  7). 

Symbol  Change  Algorithm 

The  symbol  change  algorithm  is  designed  to  produce 
an  optimally  human-separable  symbol  set  by  changing  one 
or  more  symbols  within  a given  set  of  symbols  in  order 
to  reduce  the  number  of  discriminability  errors  caused 
by  similarities  in  the  basic  form  or  shape  of  certain 
symbols  within  the  set.  The  symbols  that  cause  these 
confusion  errors  are  changed  one  at  a time  until  the 
Euclidian  distance  between  each  possible  pair  of  sym- 
bols is  greater  than  or  equal  to  an  arbitrary  threshold 
distance.  A threshold  distance  of  0.5  is  used  for  the 
results  of  Chapter  IV.  This  value  is  chosen  to  approx- 
imate an  error  free  distance  and  is  based  on  a review 
of  a number  of  different  alphanumeric  symbol  sets.  If 
all  of  the  distances  are  increased  to  0.5  or  greater, 
the  number  of  discriminability  errors  associated  with 
the  symbol  set  should  decrease. 

t 


mmmb  mm 
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Table  IX 

Symbol  Change  Algorithm 


1.  Simulate  each  symbol  In  digital  form  by  sampling 
the  intensity  pattern  of  the  symbol  at  a finite  number 
of  evenly  spaced  points  or  by  a comparable  method. 

2.  Transform  each  digitally  represented  symbol  into 
the  Fourier  domain. 

3.  Energy  normalize  each  transformed  symbol. 

4.  Low-pass  filter  each  transformed  symbol. 

5.  Compute  the  Euclidian  distance  between  each  pos- 
sible pair  of  filtered,  transformed  symbols. 

6.  Compute  the  symbol  set  confusability  number  (SSC) 
from  the  ordered  distance  matrix  according  to  Eq  (4). 

7.  Determine  the  most  conf usable  symbol  in  the  set 
from  the  method  using  the  ordered  distance  matrix. 

8.  Change  the  most  conf usable  symbol  by  controlled 
manipulation  of  the  Fourier  components  according  to 
Eq  (7),  the  weighted  component  approach. 

9.  Obtain  a spatial  image  of  the  changed  symbol  by 
taking  the  inverse  Fourier  transform  using  the 
changed,  filtered  Fourier  components. 

10.  Digitally  reproduce  the  changed  symbol  from  the 
spatial  image. 

11.  Repeat  steps  two  through  ten  until  the  minimum 
Euclidian  distance  between  each  possible  pair  of 
symbols  is  greater  than  or  equal  to  the  threshold 
Euclidian  distance. 
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The  symbol  that  is  changed  is  the  most  confusable 
symbol  in  the  symbol  set.  For  the  results  in  Chapter 
IV,  the  most  confusable  symbol  is  determined  from  the 
method  using  the  ordered  distance  matrix  rather  than 
the  symbol  set  confusability  matrix.  This  method  is 
used  since  the  most  confusion  is  between  those  symbols 
that  are  separated  by  the  smallest  Euclidian  distance. 

The  Fourier  components  of  the  most  confusable  sym- 
bol are  changed  by  the  weighted  component  approach  us- 
ing Eq  (7).  This  approach  provides  the  most  control- 
lable change  in  the  shape  of  the  symbol  so  that  the 
resulting  spatial  image  derived  from  the  changed  Fou- 
rier components  is  easily  interpreted  to  enable  digital 
simulation  of  the  changed  symbol.  This  new  symbol  in 
digital  form  then  replaces  the  original  symbol  and  the 
algorithm  is  repeated  until  the  minimum  Euclidian  dis- 
tance threshold  is  achieved  for  all  distances  in  the 
symbol  set. 

The  symbol  change  algorithm  effectively  improves  a 
given  symbol  set  based  on  the  increase  in  the  Euclidian 
distances.  The  symbol  set  confusability  number  (SSC), 
which  is  computed  from  the  ordered  distance  matrix 
according  to  Eq  (4),  also  provides  a measure  of  the 
relative  separability  of  a symbol  set.  A symbol  set 
with  a lower  SSC  has  fewer  discriminability  errors  as- 
sociated with  it.  Based  on  the  Euclidian  distances  of 
the  four  symbol  sets  used  in  this  study  and  the  results 
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of  Table  IV,  an  SSC  of  less  than  one  implies  that  the 
given  symbol  set  is  a comparatively  good  symbol  set 
that  would  produce  relatively  few  errors. 

Respmmenfla-frlgng  ffiT  Further  Research 

Although  a symbol  set  with  a low  SSC  implies  rela- 
tively few  confusion  errors,  the  actual  error  rate  in 
any  arbitrary  application  cannot  be  predicted  with  our 
present  knowledge.  Also,  the  hypothesized  reduction  in 
errors  after  employing  the  symbol  change  algorithm  is 
unknown.  Further  research  is  needed  to  verify  these 
results  with  actual  data  obtained  by  testing  human  sub- 
jects as  Goble  did  for  discriminability  errors  (Ref  10) 
but  in  this  case  using  symbol  sets  both  before  and  af- 
ter application  of  the  symbol  change  algorithm.  Human 
performance  data  would  provide  a more  definite  rela- 
tionship between  the  number  of  errors  and  the  SSC  so  as 
to  enable  the  number  of  errors  to  be  determined  from 
each  SSC. 

The  number  of  errors  is  also  related  to  the  Euclid- 
ian distance  between  symbols.  This  is  an  inverse  rela- 
tionship, however,  the  minimum  distance  for  an  error 
free  symbol  set  requires  further  study.  Although  a 
threshold  distance  of  0.5  was  used  for  this  particular 
project,  this  threshold  value  is  still  an  arbitrary 
number  that  is  intended  to  approximate  an  error  free 
distance.  Is  there,  in  fact,  one  such  universal  error 
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free  distance,  or  is  this  distance  dependent  upon  the 
number  of  symbols,  the  type  of  symbols  (alphanumeric, 
geometric,  or  size  variant,  for  example),  or  other  fac- 
tors? It  is  this  question  that  is  addressed  in  the  ap- 
pendix but  still  requires  more  investigation  in  order 
to  reach  an  answer. 

Another  question  for  consideration  involves  the 
definition  of  the  most  conf usable  symbol.  Would  more 
errors  be  eliminated  by  choosing  a different  most  con- 
fusable  symbol?  A second  definition  was  discussed  in 
Chapter  III.  Another  definition  for  this  symbol  would 
be  to  select  the  nearest  neighbor  to  the  currently  de- 
fined most  conf usable  symbol.  It  may  be  possible  to 
reduce  the  errors  even  further  by  selecting  a different 
order  for  the  symbols  to  be  changed.  Human  performance 
data  may  verify  that  another  definition  of  the  most 
confusable  symbol  may  better  optimize  a symbol  set. 

A different  direction  of  movement  of  the  most  con- 
fusable symbol  in  effecting  the  symbol  change  may  also 
reduce  the  number  of  errors  associated  with  the  com- 
pletely changed  symbol  set.  One  method  would  weight 
the  Fourier  components  of  all  or  part  of  the  neighbor- 
ing symbols  in  the  set.  This  method  would  slightly  al- 
ter symbol  change  Eq  (7)  by  redefining  FCnbr.  The  value 
of  FCnbr  could  be  the  average  of  the  corresponding 
Fourier  components  for  all  or  part  of  the  neighboring 
symbols.  FCnbr  could  also  be  the  average  of  these  com- 
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ponents  which  are  weighted  by  a factor  equal  to  the 
reciprocal  of  the  rank  of  each  neighboring  symbol  where 
the  rank  is  one  for  the  nearest  neighbor,  two  for  the 
next  closest  symbol  and  so  on.  A weighting  factor 
corresponding  to  one  minus  the  distance  between  the 
symbols  might  also  be  used. 

The  symbol  change  approach  which  separates  like 
Fourier  components  according  to  Eq  (9)  also  merits  fur- 
ther investigation.  If  the  energy  that  is  contributed 
to  the  spatial  background  by  the  changed  Fourier  com- 
ponents can  be  controlled  or  eliminated,  this  approach 
may  provide  a larger  reduction  in  the  number  of  dis- 
criminability  errors. 

Another  related  area  for  possible  future  study 
deals  with  the  reproduction  of  the  changed  symbol.  Can 
the  changed  symbol  be  interpreted  from  the  spatial  image 
more  accurately  by  some  other  means?  It  may  be  pos- 
sible to  redigitize  the  changed  symbol  from  the  spatial 
image  created  from  the  inverse  Fourier  transform  of  the 
filtered  components.  One  suggested  technique  would 
restrict  the  changes  in  the  symbol  to  only  neighboring 
points  or  regions  of  the  original  intensity  pattern  or 
the  previously  changed  intensity  pattern.  Another 
technique  would  not  allow  a change  if  the  correlation 
of  the  changed  spatial  image  with  the  original  or  pre- 
viously changed  spatial  image  did  not  exceed  some 
minimum  threshold. 


Although  the  approach  used  for  this  symbol  change 
algorithm  is  thought  to  be  the  best  method,  additional 
data  from  other  symbol  sets  and  human  performance  tests 
may  offer  new  insight  such  that  other  alternatives 
might  further  reduce  the  number  of  discriminability 
errors  in  a symbol  set.  These  recommended  areas  for 
further  research  may  later  provide  a more  efficient 
algorithm  to  generate  an  optimally  human-separable 
symbol  set. 
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Appendix 


malfflUB  SC£SC  Free  glg$ftng& 

The  problem  of  selecting  a suitable  threshold  Eu- 
clidian distance  for  the  symbol  change  algorithm  led  to 
the  possibility  of  discovering  some  minimum  distance 
with  which  no  confusion  errors  would  occur.  It  would 
be  desirable  to  change  the  minimum  number  of  symbols  in 
a given  symbol  set  as  is  required  to  increase  the  Eu- 
clidian distance  between  all  possible  pairs  of  symbols 
so  that  the  symbol  set  would  cause  no  discriminability 
errors.  Some  additional  research  and  results  on  this 
aspect  of  symbology  standardization  are  discussed  and 
presented  in  this  appendix. 

The  Euclidian  distance  calculations  for  symbol  set 
two  (see  Figure  2)  yielded  the  interesting  result  that 
the  pair  of  symbols  with  the  largest  distance  between 
them  (1.0444)  were  the  two  squares  NAV  and  RAD.  The 
only  difference  in  these  two  symbols  was  the  size.  The 
NAV  symbol  was  a 36x36  square  on  the  64x64  point  grid 
while  the  RAD  symbol  was  an  16x16  square. 

This  size  variation  has  previously  been  used  in 
visual  coding  schemes  and  a maximum  and  recommended 
number  of  symbols  or  coding  steps  which  a human  can 
differentiate  has  been  linked  with  various  error  rates 
(Ref  21i69-77).  Various  coding  techniques  were  inves- 


tigated  by  a research  group  headed  by  Muller  in  1955 
(Ref  15).  It  was  believed  that  the  data  gathered  by 
Muller  would  provide  some  measure  between  the  number  of 
errors  of  the  size  variant  coding  techniques  and  the 
Euclidian  distances.  Although  the  appropriate  informa- 
tion was  originally  recorded  by  Muller,  the  number  of 
errors  between  pairs  of  symbols  was  not  included  in  his 
report.  However,  he  did  report  a maximum  number  and  a 
recommended  number  of  symbols  for  each  coding  technique 
and  an  associated  error  rate  using  the  Gamer-Hake 
method  of  measuring  the  accuracy  of  stimulus  identifi- 
cation in  informational  terms  (Ref  2 1 42-6?  and  Ref  8). 
Two  of  these  coding  techniques  were  selected  for  study 
in  an  effort  to  arrive  at  a minimum  error  free  Euclid- 
ian distance. 

Magnitude  coding  and  area  coding  (Ref  21 1 73-75) 
were  investigated  using  Gagnon's  predictor  of  human 
visual  performance  (PREVIP)  algorithm  to  generate  the 
distance  matrices  (Ref  7).  A field  of  view  of  0.63  de- 
grees by  O.63  degrees  and  a size  of  interest  of  0.25 
cycles  per  degree  were  the  input  parameters  used  for 
each  of  six  symbol  sets.  The  symbols  were  digitized  on 
a 64x64  point  grid  and  were  confined  in  this  grid  to  a 
40x40  point  area  which  corresponded  to  30  minutes  of 
visual  angle  by  applying  the  field  of  view  and  size  of 
interest  input  parameters.  The  largest  symbols  were  a 
40  point  diameter  circle  which  represented  Muller's 
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area  coded  circular  blip  of  30  minutes  of  visual  angle 
(Ref  15ill-12  and  Ref  21i75)  as  well  as  his  magnitude 
coded  ellipse  of  unit  axis  ratio  (Ref  15«9-10  and  Ref 
21 1 ?4)  and  a 40x40  square  which  represented  the  largest 
symbol  in  four  additional  magnitude  and  area  coded  sym- 
bol sets.  The  latter  four  symbol  sets  were  chosen  be- 
cause the  resolution  on  the  64x64  point  grid  could  not 
accurately  represent  circles  and  ellipses  in  digital 
form,  especially  for  smaller  diameters  and  axis  ratios. 
The  square  provided  a similar  starting  point  in  size  of 
visual  angle  for  the  largest  symbol  in  each  set  but,  as 
one  or  both  dimensions  were  varied  as  a percentage  of 
the  40  point  maximum  dimension,  the  smaller  symbols  were 
simulated  more  accurately  than  similarly  smaller  symbols 
in  the  circular  or  elliptical  sets. 

These  six  symbol  sets  are  depicted  in  Figures  10 
through  15  and  the  corresponding  distance  matrices  are 
tabulated  in  Tables  X through  XV.  The  distances  were 
larger  between  the  middle  symbols  of  each  set  than  the 
distances  between  the  symbols  at  the  end  points.  The 
average  distance  between  symbols  A and  B,  B and  C,  C 
and  D,  and  D and  E in  the  three  five-symbol  sets  was 
0.6835  while  that,  following  a similar  progression,  in 
the  three  eight-symbol  sets  was  0.4452  which  yielded  an 
average  distance  of  0.5968  (four  distances  in  the  five- 
symbol  sets  and  seven  in  the  eight-symbol  sets).  The 
average  of  all  the  distances  in  the  three  five -symbol 
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* ( sets  was  0.9300  while  that  for  the  three  eight-symbol 

sets  was  0.6895  yielding  an  overall  average  distance  of 
0.866?  (ten  distances  in  each  five-symbol  set  and  28  in 
each  eight-symbol  set). 

Thus(  in  conclusion,  both  average  distances  of 
0.5968  and  0.8667  are  greater  than  the  0.5  threshold 
distance  used  in  the  symbol  change  algorithm.  Since 
errors  are  assumed  to  exist  in  these  six  symbol  sets 
based  on  Muller's  results,  does  the  threshold  distance 
of  0.5  change  the  symbol  enough?  Should  a higher 
threshold  distance  be  used  and,  if  so,  will  it  signifi- 
cantly reduce  the  number  of  errors  even  further?  Can 
each  Euclidian  distance  be  associated  with  a definite 
number  of  errors  or  is  the  number  of  errors  dependent 
upon  the  number  of  symbols  in  the  set  and/or  the  type 
of  coding  used?  And,  finally,  is  there  a universal 
error  free  distance?  It  would  be  useful  if  some  or  all 
of  these  questions  could  be  answered,  and  this  appendix 
is  written  to  provide  some  data  as  a base  for  approach- 
ing these  answers. 


Symbol i 

• 

e 

• 

• 

• 

Visual  Angle 

in  Minutest 

5 

7 

12 

21 

30 

Letter  Codei 

FCA 

FCB 

FCC 

PCD 

FCE 

Figure  10.  Five-Symbol  Set  of  Circular  Blips 


Table  X 

Distance  Matrix  for 
Five-Symbol  Set  of  Circular  Blips 
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Table  XII 

Distance  Matrix  for 
Five -Symbol  Set  of  Square  Blips 


FSA 
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FSE 
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Table  XV 

Distance  Matrix  for 
Five-Symbol  Set  of  Squares 
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